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Abstract 

We investigate the most general phase space of configurations, consisting of the collection 
of all possible ways of assigning elementary attributes, "energies" , to elementary positions, 
"cells" . We discuss how this space defines a "universe" with a structure that can be approx- 
imately described by a quantum-relativistic physical scenario in three space dimensions. In 
particular, we discuss how the Heisenberg's Uncertainty and the bound on the speed of light 
arise, and what kind of mechanics rules on this space. 
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1 Introduction 

The search for a unified description of quantum mechanics and general relativity, within a 
theory that should possibly describe also the evolution of the universe, is one of the long 
standing and debated open problems of modern theoretical physics. The hope is that, once 
such a theory has been found, it will open us a new perspective from which to approach, 
if not really answer, the fundamental question behind all that, that is "why the universe is 
what it is". On the other hand, it is not automatic that, once such a unified theory has been 
found, it gives us also more insight on the reasons why the theory is what it is, namely, why it 
has to be precisely that one, and why no other choice could work. But perhaps it is precisely 
going first through this question that it is possible to make progress in trying to solve the 
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starting problem, namely the one of unifying quantum mechanics and relativity. Indeed, 
after all we don't know why do we need quantum mechanics, and relativity, or, equivalently, 
why the speed of light is a universal constant, or why there is the Heisenberg Uncertainty 
We simply know that, in a certain regime, Quantum Mechanics and Relativity work well in 
describing physical phenomena. 

In this work, we approach the problem from a different perspective. We do not assume 
quantum mechanics, nor relativity. The question we start with can be formulated as follows: 
is it possible that the physical world, as we see it, doesn't proceed from a "selection" principle, 
whatever this can be, but it is just the collection of all the possible "configurations", intended 
in the most general meaning? May the history of the Universe be viewed somehow as a 
path through these configurations, and what we call time ordering an ordering through the 
inclusion of sets, so that the universe at a certain time is characterized by its containing as 
subsets all previous configurations, whereas configurations which are not contained belong 
to the future of the Universe? What is the meaning of "configuration" , and how are then 
characterized configurations, in order to say which one is contained and which not? How do 
they contribute to build up what we observe? 

Let us consider the most general possible phase space of "spaces of codes of information" . 
By this we mean products of spaces carrying strings of information of the type "1" or "0" 
(we will comment at the end of the paper about the generality of the choice of working with 
binary codes). If we interpret these as occupation numbers for cells that may bear or not a 
unit of energy, we can view the set of these codes as the set of assignments of a map \l/ from a 
space of unit energy cells to a discrete target vector space, that can be of any dimensionality. 
If we appropriately introduce units of length and energy, we may ask what is the geometry 
of any of these spaces. Once provided with this interpretation, it is clear that the problem 
of classifying all possible information codes can be viewed as a classification of the possible 
geometries of space, of any possible dimension. If we consider the set of all these spaces, 
i.e. the set of all maps, {^}, that we call the phase space of all maps, we may also ask 
whether some geometries occur more or less often in this phase space. In particular, we may 
ask this question about {^(E)}, the set of all maps which assign a finite amount of energy 
units, N = E. The frequency by which these spaces occur depends on the combinatorics of 
the energy assignments Q. Indeed, it turns out that not only there are configurations which 
occur more often than other ones, but that there are no two configurations with the same 
weight. If we call {^(E)} the "universe" at "energy" E, we can see that we can assign a 
time ordering in a natural way, because {^>(E')} "contains" {^>(E)} if E' > E, in the sense 
that V$ G 3*' e {$(E')} such that \P C E plays therefore the role of a time 

parameter, that we can call the age of the universe, T . Our fundamental assumption is that, 
at any time E, there is no "selected" geometry of the universe: the universe as it appears is 
given by the superposition of all possible geometries. Namely, we assume that the partition 
function of the universe, i.e. the function through which all observables are computed, is 

1 In order to unambiguously define these frequencies, it is necessary to make a "regularization" of the 
phase space by imposing to work at finite volume. This condition can then be relaxed once a rcgularization- 
independent prescription for the computation of observables is introduced. 
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given by: 

Z{E) = J2e SmE)) , (1-1) 

9(E) 

where S(*f>) is the entropy of the configuration \l/ in the phase space {^}, related to the weight 
of occupation in the phase space W / (\E f ) in the usual way: S = log W . Rather evidently, the 
sum is dominated by the configurations of highest entropy. The most recurrent geometries 
of this universe turn out to be those corresponding to three dimensions. Not only, but 
the very dominant configuration is the one that, in the continuum limit, corresponds to a 
three-sphere of radius R proportional to E. That is, a black hole-like universe in which the 
energy density is ~ In this scenario there is basically no free parameter 

except for the only running quantity, the age of the universe, in terms of which everything 
is computed. Out of the dominant configuration, a three-sphere, the contribution given by 
the other configurations to ( II. ip is responsible for the introduction of "inhomogeneities" in 
the universe. These are what gives rise to a varied spectrum of energy clusters, that we 
interpret as matter and fields evolving and interacting during a time evolution set by the 
.E-time-ordering. 

The most striking feature is that all these configurations summed up contribute for a 
correction to the total energy of the universe of the order of AE ~ 1/7". This is rather 
reminiscent of the inequality at the base of the Heisenberg Uncertainty Principle on which 
quantum mechanics is based on: T, the age/radius up to the horizon of observation, can 
also be written as At, the interval of time during which the universe of radius E has been 
produced. That means, the universe is mostly a classical space, plus a "smearing" that 
quantitatively corresponds to the Heisenberg uncertainty, AE ~ 1/At. This argument can 
be refined and applied to any observable one may define: all what we observe is given by a 
superposition of configurations and whatever value of observable quantity we can measure 
is smeared around, is given with a certain fuzziness, which corresponds to the Heisenberg's 
inequality. Indeed, a more detailed inspection of the geometries that arise in this scenario, 
the way "energy clusters" arise, their possible interpretation in terms of matter, particles etc. 
allows to conclude that 11.11 formally implies a quantum scenario, in which the Heisenberg 
Uncertainty receives a new interpretation. The Heisenberg uncertainty relation arises here 
as a way of accounting not simply for our ignorance about the observables, but for the 
ill-definedness of these quantities in themselves: all the observables that we may refer to 
a three-dimensional world, together with the three-dimensional space itself, exist only as 
"large scale" effects. Beyond a certain degree of accuracy they can neither be measured nor 
be defined. The space itself, with a well defined dimension and geometry, cannot be defined 
beyond a certain degree of accuracy either. This is due to the fact that the universe is not 
just given by one configuration, the dominant one, but by the superposition of all possible 
configurations, an infinite number, among which many (an infinite number too) don't even 
correspond to a three dimensional geometry. 

2 The radius of the black hole is the radius of the three-ball enclosed by the horizon surface. The radius of 
the three sphere does not coincide with the radius of the ball; they are anyway proportional to each other. 
How, and in which sense, a sphere can be said to have, like a ball, a boundary, which works as horizon, is a 
rather non-trivial fact related to the very special topology of this space, discussed in detail in Ref. [1] . 
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It is possible to show that the speed of expansion of the geometry of the dominant con- 
figuration of the universe, i.e. the speed of expansion of the radius of the three-dimensional 
black hole, that by convention and choice of units we can call "c", is also the maximal 
speed of propagation of coherent , i.e. non-dispersive, information. This can be shown to 
correspond to the bound of the speed of light (see Ref. P). Here it is essential that we 
are talking of coherent information, as tachyonic configurations also exist and contribute 
to 11.11 their contribution is collected under the Heisenberg uncertainty. One may also show 
that the geometry of geodesies in this space corresponds to the one generated by the en- 
ergy distribution. All this means that this framework "embeds" in itself special and general 
relativity. 

The dynamics implied by (11. ip is neither deterministic in the ordinary sense of causal 
evolution, nor probabilistic. At any age the universe is the superposition of all possible con- 
figurations, weighted by their "combinatorial" entropy in the phase space. According to our 
definition of time and time ordering, at any time the actual superposition of configurations 
does not depend on the superposition at a previous time, because the actual and the previous 
one trivially are the superposition of all the possible configurations at their time. Never- 
theless, on the large scale the flow of mean values through the time can be approximated 
by a smooth evolution that we can, up to a certain extent, parametrize through evolution 
equations. As it is not possible to exactly perform the sum of infinite terms of ll.l[ and it 
does not even make sense, because an infinite number of less entropic configurations don't 
even correspond to a description of the world in terms of three dimensions, it turns out to 
be convenient to accept for practical purposes a certain amount of unpredictability, intro- 
duce probability amplitudes and work in terms of the rules of quantum mechanics. These 
appear as precisely tuned to embed the uncertainty that we formally identified with the 
Heisenberg Uncertainty into a viable framework, which allows some control of the unknown, 
by endowing the uncertainty with a probabilistic interpretation. Within this theoretical 
framework, we can therefore give an argument for the necessity of a quantum description 
of the world: quantization appears to be a useful way of parametrizing the fact of being 
the observed reality a superposition of an infinite number of configurations. Once endowed 
with this interpretation, this scenario provides us with a theoretical framework that unifies 
quantum mechanics and relativity in a description that, basically, is neither of them: in this 
perspective, they turn out to be only approximations, valid in a certain limit, of a more 
comprehensive formulation. 

As discussed in pQ, the "spectrum" of the theory, namely the microscopical content of 
particles and their interactions, can be investigated via string theory tools. In this frame- 
work, String Theory arises as a consistent quantum theory of gravity and interacting fields 
and particles, which constitutes a useful mapping of the combinatorial problem of "distri- 
bution of energy along a target space" into a continuum space. Once so interpreted, it is no 
more a "free" theory. Like the physics implied by ll.l[ it is on the contrary highly predictive. 
Within this framework it is even possible to see its uniqueness [T] . For a detailed analysis of 
the spectrum of the theory implied by 11.11 and the phenomenological implications, we refer 
the reader to [2], [3] [I], and [5]. In particular, Refs. [3] and [5] show how this theoretical 
framework, being on its ground a new approach to quantum mechanics and phenomenology, 
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does not simply provide us with possible answers to problems which are traditionally referred 
to quantum gravity and string theory, like the computation of spectrum, masses and cou- 
plings of the elementary particles, but opens new perspectives about problems apparently 
pertaining to other domains of physics, such as (high temperature) superconductivity and 
evolutionary biology. 

2 The set-up 

2.1 Distributing degrees of freedom 

Consider a generic "multi-dimensional" space, consisting of Mf 1 x . . . x Mf* . . . x M£ n 
"elementary cells". Since an elementary, "unit" cell is basically a- dimensional, it makes 
sense to measure the volume of this p-dimensional space, p = ^2™Pi, in terms of unit cells: 
V = Mf 1 x ... x M% n . Although with the same volume, from the point of view of the 
combinatorics of cells and attributes this space is deeply different from a one-dimensional 
space with V cells. However, independently on the dimensionality, to such a space we can 
in any case assign, in the sense of "distribute", N "elementary" attributes, N < V. Indeed, 
in order to preserve the basic interpretation of the "JV" coordinate as "attributes" and the 
"M" degrees of freedom as "space" coordinates, to which attributes are assigned, it is nec- 
essary that N < M n , VnB What are these attributes? Cells, simply cells: our space is 
simply a mathematical structure of cells, and cells that we attribute in certain positions to 
cells. By doing so, we are constructing a discrete "function" y = f(x), where y runs in the 
"attributes" and x G {M® p } belongs to our p-dimensional space. We define the phase space 
{\I/(A r )} as the space of the assignments, the "maps" \1/: 

m : N ^ \\®Mf n , Mi > N. (2.1) 

i 

For large Mj and N, we can approximate the discrete degrees of freedom with continuous 
coordinates: Mj — > r i; N — » R. We have therefore a dimensional space with volume F] rf*, 

and a continuous map y G {R} A x G where R < Vz. In the following we will 

always consider Mj ^> N, while keeping V finite. This has to considered as a regularization 
condition, to be eventually relaxed by letting V — » oo. 

The assignments 12.11 are basically assignments of binary codes. However, if we call N 
the total energy, and the M space coordinates, it is clear that the *f>(N) are assignments 
of geometries, and that {\l/(iV)} is the phase space of all the possible geometries at energy 
N. To stay general, let us call them "configurations". In order to appropriately compare 
configurations through the corresponding geometries, we may think of fixing the highest 
dimensionality of space, say P, fix a volume Vp of this P-dimensional space @, and work 
with the subclass of configurations that correspond to spaces of dimension p < P, and 

3 In the case N > M n for some n, we must interchange the interpretation of the N as attributes and 
instead consider them as a space coordinate, whereas it is M n that are going to be seen as a coordinate of 
attributes. 

4 Indeed, P < V because it does not make sense to speak of a space direction with less than one space 
cell. 
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volume smaller than Vp. In this way, all the geometries can be thought as embedded in a 
common, higher space. P and Vp will then be let to go to infinity. 

We want to investigate now what is the entropy of a certain configuration in this phase 
space. An important observation is that there do not exist two configurations with the same 
entropy: if they have the same entropy, they are perceived as the same configuration. The 
reason is that we have a combinatoric problem, and, at fixed N, the volume of occupation in 
the phase space is related to the symmetry group of the configuration. In practice, we classify 
configurations through combinatorics: a configuration corresponds to a certain combinatoric 
group. Now, discrete groups with the same volume, i.e. the same number of elements, are 
homeomorphic. This means that they describe the same configuration. Configurations and 
entropies are therefore in bijection with discrete groups, and this removes the degeneracy. 
Different entropy = different occupation volume = different volume of the symmetry group; in 
practice this means that we have a different configuration. The most entropic configurations 
are the "maximally symmetric" ones, i.e. those that look like spheres in the above sense. 

2.2 Entropy of spheres 

In order to compute the entropy of a sphere, we proceed as follows. Let us consider distribut- 
ing the N energy attributes along a p-sphere of radius m, m < M; our problem is then to 
find out what are the most entropic ways of occupy N of the ~ m v cells of the sphere 0. For 
any dimension, the most symmetric configuration is of course the one in which one fulfils the 
volume, i.e. N ~ mP . However, we are bound to the constraint N < m for any coordinate, 
otherwise we loose the interpretation at the ground of the whole construction, namely of 
N as the coordinate of attributes, and m as the target of the assignment. N ~ mP means 
m ~ \^N, which implies m < N . The highest entropy we can attain is therefore obtained 
with the largest possible value of N as compared to m, i.e. N — m, where once again the 
equality is intended up to an appropriate, p-dependent coefficient. Let us start by consider- 
ing the entropy of a three-sphere. The weight in the phase space will be given by the number 
of times such a sphere can be formed by moving along the symmetries of its geometry, times 
the number of choices of the position of, say, its centre, in the whole space. Since we even- 
tually are going to take the limit V — > oo, we don't consider here this second contribution, 
which is going to produce an infinite factor, equal for each kind of geometry, for any finite 
amount of total energy N. We will therefore concentrate here on the first contribution, the 
one that from three-sphere and other geometries. To this purpose, we solve the "differential 
equation" (more properly, a finite difference equation) of the increase in the combinatoric 
when passing from m to m + 1. Owing to the multiplicative structure of the phase space 
(composition of probabilities), expanding by one unit the radius, or equivalently the scale of 
all the coordinates, means that we add to the possibilities to form the configuration for any 
dimension of the sphere some more ~ m + 1 times (that we can also approximate with m, 
because we work at large m) the probability of one cell times the weight of the configuration 
of the remaining m (respectively m — 1) cells. But this is not all the story: since distributing 

5 For simplicity we neglect numerical coefficients, because we are interested here in the scaling, for large 
iV and to. This is also the reason why we use the terminology of the geometry on the continuum. 
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TV energy cells along a volume scaling as ~ m 3 , m > N means that our distribution does 
not fulfill the space, the actual symmetry group of the distribution will be a subgroup of 
the whole group of the pure "geometric" symmetry: moving along this space by an amount 
of space shorter than the distance between cells occupied by an energy unit will not be a 
symmetry, because one moves to a "hole" of energy. It is easy to realize that in such a 
"sparse" space, the effective symmetry group will have a volume that stays to the volume of 
a fulfilled space in the same ratio as the respective energy densities. Taking into account all 
these effects, we obtain the following scaling: 

W{m + 1) 3 ~ W(m) 3 x (m + l) 3 x — x — . (2.2) 

The last factor expresses the density of a circle, whereas the factor is the density of the 
three-sphere. In order to make the origin of the various terms more clear, in these expressions 
we did not use explicitly the fact that actually N is going to be eventually identified with m. 
Indeed, in 12.21 there should be one more factor: when we pass from radius mtom + 1 while 
keeping N fixed, the configuration becomes less dense, and we loose a symmetry factor of 
the order of the ratio of the two densities: [m/(m + l)] 3 ~ l + 0(l/m). Expanding W(m+1) 
on the left hand side of 12.21 as W(m) + AW(m), and neglecting on the r.h.s. corrections of 
order 1/m, we can write it as: 

7777 v ^ m. (2.3) 
W(m) 3 

Since we are interested in the behaviour at large m, we can approximate it with a continuous 
variable, m — > x, x, and approximate the finite difference equation with a differential one. 
Upon integration, we obtain: 

S 3 oc \nW(m) 3 ~\rn 2 , (2.4) 

where it is intended that N = m. Without this identification, the factor (m/N) in 12.21 would 
not be the density of a 1-sphere. Under this condition, the energy density of the three-sphere 
scales as 1/N 2 , and we obtain an equivalence between energy density and curvature R: 

Ps(N) ~ £* 1 ~ R {3) . (2.5) 

This is basically the Einstein's equation relating the curvature of space to the tensor ex- 
pressing the energy density. Indeed, here this relation can be assumed to be the physical 
description of a sphere in three dimension. We can certainly think to formally distribute 
the N energy units along any kind of space with any kind of geometry, but what makes 
a curved space physically distinguishable from a flat one, and a particular geometry from 
another one? Geometries are characterized by the curvature, but how does one observer 
measure the curvature? The coordinates m of the target space have no meaning without 
energy units distributed along them. The geometry is decided by the way we assign the 
N occupation positions. Here therefore we assume that measuring the curvature of space 
is nothing else than measuring the energy density. For the time being, let us just take the 
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equivalence between energy density and curvature as purely formal; we will see in the next 
sections that this, with our definition of energy, will also imply that physical particles move 
along geodesies of the so characterized space, precisely as one expects from the Einstein's 
equations. We will come back to these issues in section |5j In a generic dimension p > 2 the 
condition for having the geometry of a sphere reads fl 



mP rrr 



In dimension p > 3 it is solved by: 

m ~ < N , p>3. (2.7) 

In two dimensions, 12.61 implies N — 1 (up to some numerical coefficient). This means that, 
although it is technically possible to distribute N > 1 energy units along a two-sphere of 
radius m > 1, from a physical point of view these configurations do not describe a sphere. 
This may sound strange, because we can think about a huge number of spheric surfaces 
existing in our physical world, and therefore we may have the impression that attempting 
to give a characterization of the physical world in the way we are here doing already fails 
in this simple case. The point is that all the two spheres of our physical experience do not 
exist as two-dimensional spaces alone, but only as embedded in a three-dimensional physical 
space, i.e. as subspaces of a three-dimensional space. In dimensions higher than three, the 
equivalent of 12. 21 reads: 

N m 

W(m + 1)„ ~ W(m) p x (m + l) p x — x— . (2.8) 

mP N 

The last term on the r.h.s. is actually one, because it was only formally written as Njm to 
keep trace of the origin of the various terms. Indeed, it indicates the density of a fulfilling 
space, to which the scaling of the weight of any dimension must be normalized. Inserting 
the condition for the p-sphere, equation 12.61 we obtain: 

W (m + 1)_ ~ W (m) p x (m + l) p x At , (2.9) 

777/ 



which leads to the following finite difference equation: 

777 p - 2 . (2.10) 



AW(m) p 2 



W(m) p 

This expression obviously reduces to 12.31 for p = 3. Proceeding as before, by transforming 
the finite difference equation into a differential one, and integrating, we obtain: 

S (p>2 ) oc hxWim) -m p - 1 , p>3. (2.11) 

p — 1 



6 We recall that we omit here p- dependent numerical coefficients which characterize the specific normal- 
ization of the curvature of a sphere in p dimensions, because we are interested in the scaling at generic JV, 
and m, in particular in the scaling at large N. 



8 



This is the typical scaling law of the entropy of a p-dimensional black hole (see for instance 
[6]). For p = 2, if we start from 12.8] without imposing the condition 12.61 of the sphere, we 
obtain, upon integration: 

S {2 ) ~ N\ (2.12) 

formally equivalent to the entropy of a sphere in three dimensions. However, the fact that 
the condition of the sphere 12.61 implies N = 1 means that a homogeneous distribution of the 
N energy units corresponds to a staple of N two-spheres. Indeed, if we use 12.91 and 12.101 f° r 
which the condition N = 1 is intended, we obtain: 

5 (2) ~ m. (2.13) 

For a radius m — N, this gives 1/N of the result I2.12[ confirming the interpretation of this 
space as the superposition of N spheres. From a physical point of view, we have therefore N 
times the repetition of the same space, whose true entropy is not iV 2 but simply N. As we 
will see in the next sections, such a kind of geometries correspond to what we will interpret 
as quantum corrections to the geometry of the universe. In the case of p = 1, from a purely 
formal point of view the condition of the sphere [2761 would imply N = 1/m. Inserted in 12.81 
and integrated as before, it gives: 

oc \nW(m) ~ mm, p=l. (2.14) 

Indeed, in the case of the one-sphere, i.e. the circle, one does not speak of Riemann curvature, 
proportional to 1/r 2 , but simply of inverse of the radius of curvature, 1/r. It is on the other 
hand clear that the most entropic configuration of the one- dimensional space is obtained by 
a complete fulfilling of space with energy units, N = m, and that the weight in the phase 
space of this configuration is simply: 

W(N)i ~ N, (2.15) 

in agreement with l2.14fl For the spheres in higher dimension, from expression 12.111 and 12 . 7W c 
derive: 

S(p>3)\n ^—m^ l —N&. (2.16) 

p — 1 p — 1 

For large p the weights tend therefore to a p-independent value: 

W(N) P ^ « e^ , (2.17) 

and their ratios tend to a constant. As a function of they are exponentially suppressed as 
compared to the three-dimensional sphere. The scaling of the effective entropy as a function 
of N allows us to conclude that: 

• At any energy N, the most entropic configuration is the one corresponding to the 
geometry of a three-sphere. Its relative entropy scales as S ~ A^ 2 . 

Spheres in different dimension have an unfavoured ratio entropy /energy. Three dimen- 
sions are then statistically "selected out" as the dominant space dimensionality. 



We always factor out the group of permutations, which brings a volume factor AM common to any 
configuration of N energy cells. 
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2.3 The "time" ordering 



A property of {tf(iV)} is that, if TVi < N 2 Vtf(iVi) G {*(^i)} 3*'(iV 2 ) G {^(^2)} 
such that ^'(A^) 2 ^(Ai), something that, with an abuse of language, we write as: 
{^(AT 2 )} D {^(Ai)}, V Ai < A 2 . It is therefore natural to introduce an ordering in the 
whole phase space, that we call a "time-ordering", through the identification of A with the 
time coordinate: A <-> t. We call "history of the Universe" the "path" A -> {$(N)}U- This 
ordering turns out to quite naturally correspond to our everyday concept of time-ordering. 
In our normal experience, the reason why we perceive a history basically consisting in a 
progress toward increasing time lies on the fact that higher times bear the "memory" of 
the past, lower times. The opposite is not true, because "future" configurations are not 
contained in those at lower, i.e. earlier, times. But in order to be able to say that an event 
B is the follow up of A, A ^ B (time flow from A — > B), at the time we observe B we 
need to also know A. This precisely means A E {&(Na)} and A e {$f(N B )}, which implies 
{^>(Na)} C {^(Nb)} in the sense we specified above. Time reversal is not a symmetry of 
the system^]. 

2.4 How do inhomogeneities arise 

In this set-up configurations are basically identified by their symmetry group. Configurations 
that describe the same geometry, but are "rotated" with respect to each other, as compared 
to an external reference frame, actually describe the same configuration. The reason is that 
there is no "external frame" : reference points are defined through the intrinsic asymmetries 
of the configurations in themselves. Reference points are introduced through asymmetries. 
Starting from the most entropic one, we can think to progressively obtain all the less entropic 
configurations by "moving" away the more and more units of energy to form less and less 
symmetric configurations, also walking through different dimensions. In this way, one obtains 
a tower of asymmetric configurations "stapled" on the point at which the first asymmetry has 
been introduced. The superposition of configurations does not produce therefore a uniform 
universe, but a kind of "spontaneous" breaking of any symmetry. From the property, stated 
at page El that at any time T ~ N there do not exist two inequivalent configurations with 
the same entropy, and from the fact that less entropic configurations possess a lower degree 
of symmetry, we obtain that: 

• At any time T the average appearance of the universe is that of a space in which all 
symmetries are broken. 

The amount of breaking, depending on the weight of non-symmetric configurations as 
compared to the maximally symmetric one, involves a relation between the energy (i.e. the 
deformations of the geometry) and the time spread/space length, of the deformation, that 
will be discussed in the next sections. The inhomogeneities produced in this way give rise 
to the varied spectrum of mass and energy clusters, galaxies, particles, etc... As there is no 

8 Notice that {4 f (iV)}, the "phase space at time N" , includes also tachyonic configurations. 

9 Only by restricting to some subsets of physical phenomena one can approximate the description with 
a model symmetric under reversal of the time coordinate, at the price of neglecting what happens to the 
environment. 
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external frame, in this framework there is also no external observer: an observer is a "local 
inhomogeneity" of space, and necessarily belongs to the universe. The observer is only 
sensitive to its own configuration, in the sense that he "learns" about the full space only 
through the superposition of configurations he is made of, and their changes. For instance, 
he can perceive that the configurations of space of which he is built up change with time, 
and interprets these changes as due to the interaction with an environment. 

2.5 Mean values and observables 

At any time T ~ N in the "universe" given by (\&(./V)} the mean value of any observable 
quantity O is the sum of the contributions to O over all configurations weighted according 
to their volume of occupation in the phase space: 

< O > oc W i^) °(*) • ( 2 - 18 ) 
*(T) 

We have written the symbol oc instead of = because, as it is, the sum on the r.h.s. is not 
normalized. The weights don't sum up to 1, and not even do they sum up to a finite number: 
in the infinite volume limit, they all diverge p. However, as we discussed in section I2TT1 what 
matters is their relative ratio, which is finite because the infinite volume factor is factored 
out. In order to normalize mean values, we introduce a functional that works as "partition 
function", or "generating function" of the Universe: 

z d ^j2 w w = xy w - (2.i9) 

*(T) *(T) 

The sum has to be intended as always performed at finite volume. In order to define mean 
values and observables, we must in fact always think in terms of finite space volume, a 
regularization condition to be eventually relaxed. The mean value of an observable can then 
be written as: 

<0>= (2.20) 
*(T) 

Mean values therefore are not defined in an absolute way, but through an averaging procedure 
in which the weight is normalized to the total weight of all the configurations, at any finite 
space volume V. 

2.6 Summing up geometries 

We may now ask what a "universe" given by the collection of all configurations at a given 
time N looks like to an observer. Indeed, a physical observer will be part of the universe, and 
as such correspond to a set of configurations that identify a preferred point, something less 

10 As long as the volume, i.e. the total number of cells of the target space, for any dimension, is finite, 
there is only a finite number of ways one can distribute energy units. In the infinite volume limit, both the 
number of possibilities for the assignment of energy, and the number of possible dimensions, become infinite. 
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symmetric and homogeneous than a sphere. However, let us just assume that the observer 
looks at the universe from the point of view of the most entropic configuration, namely it 
lives in three dimensions, and interprets the contribution of any configuration in terms of 
three dimensions. This means that he will not perceive the universe as a superposition of 
spaces with different dimensionality, but will measure quantities, such as for instance energy 
densities, referring them to properties of the three dimensional space, although the contri- 
bution to the amount of energy may come also from configurations of different dimension 
(higher or lower than three). 

From this point of view, let us see how the contribution to the average energy density 
of space of all configurations which are not the three-sphere is perceived. In other words, 
we must see how do the p 7^ 3 configurations project onto three dimensions. The average 
density should be given by: 



We will first consider the contribution of spheres. To the purpose, it is useful to keep in 
mind that at fixed N (i.e. fixed time) higher dimensional spheres become the more and 
more "concentrated" around the (higher-dimensional) origin, and the weights tend to a p- 
independent value for large p (see 12.71 and 12.171) . When referred to three dimensions, the 
energy density of a p sphere, p > 3, is 1/N P ~ 1 , so that, when integrated over the volume, 
which scales as ~ N p , it gives a total energy ~ N. There is however an extra factor N 3 /N p 
due to the fact that we have to re-normalize volumes to spread all the higher- dimensional 
energy distribution along a three-dimensional space. All in all, this gives a factor l/N 2(jl ~ 2 ^ 
in front of the intrinsic weight of the p-spheres. Since the latter depend in a complicated 
exponential form on P and N, it is not possible to obtain an expression of the mean value of 
the energy distribution in closed form. However, as long as we are interested in just giving 
an approximate estimate, we can make several simplifications. A first thing to consider 
is that, as we already remarked, at finite N, the number of possible dimensions is finite, 
because it does not make sense to distribute less than one unit of energy along a dimension: 
as a matter of fact such a space would not possess this dimension. Therefore, p < N. In 
the physically relevant cases N ^> 1, and we have anyway a sum over a huge number of 
terms, so that we can approximate all the weights but the three dimensional one by their 
asymptotic value, W ~ exp N. This considerably simplifies our computation, because with 
these approximations we have: 



that, in the further approximation that expiV 2 ^> iVexpiV, so that expiV 2 + N exp N m 
expiV 2 , we can write as: 



(P(E)) 



W(*(N))p(E) 9{N) 
Z mN) W(*(N)) 





(2.22) 



(P(E) N ) 



1 



1 



1 

iV 2 



+ O {e' N ) . 



(2.23) 
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We consider now the contribution of configurations different from the spheres. Let us first 
concentrate on the dimension D = 3, which is the most relevant one. The simplest defor- 
mation of a 3-sphere consists in moving just one energy unit one step away from its position 
on the sphere. Owing to this move, we break part of the symmetry. Further breaking is 
produced by moving more units of energy, and by larger displacements. Our problem is to 
estimate the amount of reduction of the weight as compared to the sphere. Let us consider 
displacing just one unit of energy. We can consider that the overall symmetry group of 
the sphere is so distributed that the local contribution is proportional to the density of the 
sphere, 1/N 2 . Displacing one unit energy cell should then reduce the overall weight by a 
factor ~ (1 — 1/N 2 ). Displacing the unit by two steps would lead to a further suppression 
of order 1/N 2 . Displacing more units may lead to partial symmetry restoration among the 
displaced cells. Even in the presence of partial symmetry restorations the suppression factor 
due to the displacement of n units remains of order « n 2 /N 2n (the suppression factor di- 
vided by the density of a sphere made of n units) as long as n <C N. The maximal effective 
value n can attain in the presence of maximal symmetry among the displaced points is of 
course N/2, beyond which we fall onto already considered configurations. This means that 
summing up all the contributions leads to a correction which is of the order of the sum of 
an (almost) geometric series of ratio 1/N 2 . Similar arguments can be applied to D ^ 3, to 
conclude that expression 12.231 receives all in all a correction of order 1/N 2 . This result is 
remarkable. As we will discuss in the following along this paper, the main contribution to the 
geometry of the universe, the one given by the most entropic configuration, can be viewed as 
the classical, purely geometrical contribution, whereas those given by the other, less entropic 
geometries, can be considered contributions to the "quantum geometry" of the universe 
From 12.231 we see that not only the three-dimensional term dominates over all other ones, 
but that it is reasonable to assume that the universe looks mostly like three-dimensional, 
indeed mostly like a three-sphere. This property becomes stronger and stronger as time goes 
by (increasing N). From the fact that the maximal entropy is the one of three spheres, and 
scales as Sr^ ~ N 2 , we derive also that the ratio of the overall weight of the configurations 
at time N — 1, normalized to the weight at time N, is of the order: 

W(N-l) » W(N)e~ 2N . (2.24) 

At any time, the contribution of past times is therefore negligible as compared to the one of 
the configurations at the actual time. The suppression factor is such that the entire set of 
three-spheres at past times sums up to a weight of the order of W(N — 1): 

AT-l 

We want to estimate now the overall contribution to the partition function due to all the 
configurations, as compared to the one of the configuration of maximal entropy. We can 



11 In Ref. [5] we discuss how the classical part of the curvature can be referred to the cosmological constant, 
while the other terms to the contribution due to matter and radiation. In particular, we recover the basic 
equivalence of the order of magnitude of these contributions, as the consequence of a non-completcly broken 
symmetry of the quantum theory which is going to represent our combinatorial construction in terms of 
quantum fields and particles. 
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view the whole spectrum of configurations as obtained by moving energy units, and thereby 
deforming parts of the symmetry, starting from the most symmetric (and entropic) configu- 
ration. In this way, not only we cover all possible configurations in three dimensions, but we 
can also walk through dimensions. In order to account for the contribution to the partition 
function of all the deformations of the most entropic geometry, we can think of a series of 
steps, in which we move from the spheric geometry one, two, three, and so on, units of 
symmetry. At large N, we can approximate sums with integrals, and account for the contri- 
bution to the "partition function" 12.191 of all the configurations by integrating over all the 
possible values of entropy, decreasing from the maximal one. In the approximation of vari- 
ables on the continuum, symmetry groups are promoted to Lie groups, and moving positions 
and degrees of freedom is a "point-wise" operation that can be viewed as taking place on 
the algebra, not on the group elements. Therefore, the measure of the integral is such that 
we sum over incremental steps on the exponent, that is on the logarithm of the weight, the 
entropy. Therefore, we can write the sum over weights as a sum over the decrements from 
the highest entropy, the one of the three-sphere, given in 12.11} namely S max = 5*0= exp TV 2 : 

Z > dLe So{1 - L) . (2.26) 

~ Jo 

This has to be taken as an approximate way of accounting for the order of magnitude of the 
contribution of the infinity of configurations. Integrating I2.26[ we obtain: 

Z « e S() M + i- V (2.27) 

The result would however not change if, instead of considering the integration on just one 
degree of freedom, parametrized by one coordinate, L, we would integrate over a huge (infi- 
nite) number of variables, each one contributing independently to the reduction of entropy, 
as in: 

- f 

Z « e 5o[1 - (Ll+ - +i " )] , (2.28) 

71=1 "* 

In the second case, I2.28[ we would have: 

Z - e^±^e-(l + ^), ,2.29) 

anyway of the same order as 12.271 Together with 12.251 this tells us also that instead of 11.11 
we could as well define the partition function of the universe at "time" £ by the sum over 
all the configurations at past time/energy E up to £: 

z £ = eSW - ( 2 - 3 °) 

if){E<£) 
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3 The Uncertainty Principle 



According to 12.201 quantities which are observable by an observer living in three dimensions 
do not receive contribution only from the configurations of extremal or near to extremal 
entropy: all the possible configurations at a certain time contribute. Their value bears 
therefore a "built-in" uncertainty, due to the fact that, beyond a certain approximation, 
experiments in themselves cannot be defined as physical quantities of a three-dimensional 
world. In section 12.31 we have established the correspondence between the "energy" N and 
the "time" coordinate that orders the history of our "universe" . Although we gave N the 
interpretation of total energy of a configuration, and as such it determines its geometry, 
this does not mean that N is also the total energy of the universe, as it is measured by an 
observer, necessarily belonging to the universe. In other words, N does not coincide with the 
operational way we define energy, related to the way we measure it. Indeed, as it is, N simply 
reflects the "time" coordinate, and states the age of the universe. The energy one can measure 
is an average quantity defined as in 12.201 In principle, since E = N for any configuration, 
the two quantities seem to coincide, but as a matter of fact they don't: re-summing E = N 
from 12 . 201 implies a knowledge of all the configurations contained in {\l/(A r )}. The universe is 
the result of a superposition in which also very singular configurations contribute, in general 
uninterpretable within the usual conceptual framework of particles, or wave-packets, and 
in general of geometries of a three-dimensional space. Therefore, not directly accessible 
to a three-dimensional observer. When we measure an energy, or equivalently a "geometric 
curvature", we refer therefore to an average and approximated concept, for which we consider 
only a subset of all the configurations of the universe. Now, we have seen that the larger is 
the "time" N, the higher is the dominance of the most probable configuration over the other 
ones, and therefore more picked is the average, the "mean value" of geometry. The error 
in the evaluation of the energy content will therefore be the more reduced, the larger is the 
time spread one considers, because relatively lower becomes the weight of the configurations 
one ignores. From 12.291 we can have an idea of what is the order of the uncertainty in 
the evaluation of energy. According to 12.281 and 12.291 the mean value of the total energy, 
receiving contribution also from all the other configurations, results to be "smeared" by an 
amount: 

<E> « E So + E So x 0(1/ So) ■ (3.1) 
That means, inserting So ~ N 2 = t 2 ~ E s : 

<E> « E So + AE So « E So + O (jj . (3.2) 

Consider a subregion of the universe, of extension At 0. Whatever exists in it, namely, 
whatever differentiates this region from the uniform spherical ground geometry of the uni- 
verse, must correspond to a superposition of configurations of non-maximal entropy. From 
our considerations of above, we can derive that it is not possible to know the energy of this 
subregion with an uncertainty lower than the inverse of its extension. In order to see this, 

12 We didn't yet introduce units distinguishing between space and time. In the usual language we could 
consider this region as being of "light-extension" Ax = cAt. 
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let's estimate what is the amount of the contribution to this energy given by the sea of 
configurations of non-maximal entropy. As discussed, these include higher and lower space 
dimensionalities, and any other kind of differently interpretable combinatorics. The mean 
energy will be given as in 13.11 However, this time the maximal entropy So (At) of this sub- 
system will be lower than the upper bound constituted by the maximal possible entropy of 
a region enclosed in a time At, namely the one of a three-sphere of radius At: 

So(At) < [At] 2 , (3.3) 

and the correction corresponding to the second term in the r.h.s. of 13.21 will just constitute 
a lower bound to the energy uncertainty 0: 

In other words, no region of extension At can be said with certainty to possess an energy 
lower than 1/ At. When we say that we have measured a mass/energy of a particle, we mean 
that we have measured an average fluctuation of the configuration of the universe around 
the observer, during a certain time interval. This measurement is basically a process that 
takes place along the time coordinate. During the time of the "experiment" , At, a small 
"universe" of superposing configurations opens up for this particle. Namely, what we are 
probing are the configurations of a space region created in a time At, in which the highest 
entropy is the one of a sphere, this time of radius AN = At, etc... According to 13.41 the 
particle possesses therefore a "ground" indeterminacy in its energy: 

AE At > 1 . (3.5) 

As a bound, this looks quite like the time-energy Heisenberg uncertainty relation 0- In the 
case we consider the whole universe itself, expression 12.291 tells us that the terms neglected 
in the partition function, due to our ignorance of the "sea" of all the possible configurations 
at any fixed time, contribute to an "uncertainty" in the total energy of the same order as 
the inverse of the age of the universe: 

A£ tot ~ O (±Y (3.6) 

Namely, an uncertainty of the same order as the imprecision due to the bound on the size of 
the minimal energy steps at time T. The quantity 1/So ~ 1/T 2 basically corresponds to the 
parameter usually called "cosmological constant" , that in this scenario is not constant 0. 
The cosmological constant therefore corresponds to a bound on the effective precision of 



13 The maximal energy can be E ~ At even for a class of non-maximal-entropy, non-spheric configurations. 
14 Introducing the Planck constant is here just a matter of introducing units enabling to measure energies 
in terms of time. 

15 The approximate value of the cosmological constant is usually computed to be A ~ 1/H§, where Ho is 
the Hubble parameter, whose value corresponds, under appropriate conversion of units, to the present age 
of the universe. 
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calculation of the predictions of this theoretical scenario. 13.51 and 13.61 tell us that theoret- 
ical and experimental uncertainties are of the same order. The bound to an experimental 
access to the universe as we know it corresponds to the limit within which such a uni- 
verse is in itself defined. Beyond this threshold, there ' of configurations in which 
i) the dimensionality of space is not fixed; ii) interactions are not defined, iii) there are 
tachyonic contributions, causality does not exist etc... beyond this threshold there is a sea 
of. . .uninterpretable combinatorics. 

• It is not possible to go beyond the Uncertainty Principle 's bound with the precision in 
the measurements, because this bound corresponds to the precision with which the quantities 
to be measured themselves are defined. 

4 Deterministic or probabilistic physics? 

The sum 11.11 implies a scenario which is neither probabilistic in the usual sense of quantum 
mechanics, nor deterministic according to the usual meaning of causality. It is rather " de- 
termined" by the partition function at any time. The universe at time N r ~ T' — T + ST ~ 
N + 1 is not obtained by running forward, possibly through equations of motion, the config- 
urations at time N ~ 7", it is not their "continuation" : it is given by the weighted sum of all 
the configurations at time T + 5T, as the universe at time T was given by the weighted sum 
of all the configurations at time T. In the large N limit, we can speak of "continuous time 
evolution" only in the sense that for a small change of time, the dominant configurations 
correspond to distributions of geometries that don't differ that much from those at previous 
time. With a certain approximation we can therefore speak of evolution in the ordinary 
sense of (differential, or difference) time equations. Owing to the fact that at any time the 
appearance of the universe is mostly determined by the most entropic configurations, in the 
average 

• the dynamics of the evolution of the system is of entropic type. 

On the other hand, a full knowledge of the infinite terms of 11.11 is impossible, and, owing 
to the fact that configurations in any dimensions are accounted, also ill-defined. From this 
point of view, the probabilistic interpretation of the Heisenberg Uncertainty given in quantum 
mechanics seems a viable way of parametrizing the unknown, reintroducing thereby a certain 
degree of predictability and calculability. This is also the case of systems in which the 
asymmetries are "hidden" below the threshold of the uncertainty I3.5[ and produce therefore 
the impression of equal probability of equivalent situations, like the two possible paths of 
an electron in the double slit experiment: being able to predict the details of an event, 
such as for instance the precise position each electron will hit on the plate, and in which 
sequence, requires to know the function "entropy" for an infinite number of configurations, 
corresponding to any space dimensionality at fixed T ~ N, for any time T the experiment 
runs on. Clearly, no computer or human being can do that. If on the other hand we content 
ourselves with an approximate predictive power, we can roughly reduce physical situations 
to certain ideal schemes, such as for instance "the symmetric double slit" problem. Of 
course, from a theoretical point of view we lose the possibility of predicting the position the 
first electron will hit the target (something anyway practically impossible to do), but we 
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gain, at the price of introducing symmetries and therefore also concepts like "probability 
amplitudes" , the capability of predicting with a good degree of precision the shape an entire 
beam of electrons will draw on the plate. We give up with the "shortest scale", and we 
concern ourselves only with an "intermediate scale", larger than the point-like one, shorter 
than the full history of the universe itself. The interference pattern arises as the dominant 
mean configuration, as seen through the rough lens of this "intermediate" scale. In this 
scenario, quantum de-coherence is "built-in" in 11.11 

5 Relativity 

As we discussed in sections I2.2H2.6[ although the volume of the target spaces of the maps 
ty(N) is eventually to be considered infinite, V — > oo, at any finite time the dominant con- 
figuration of the universe corresponds to a three-sphere of radius N ~ T. On the top of this 
staple many "almost spherical", three-dimensional configurations that, in the superposition, 
give rise to a space with energy clusters. But in the sum 12.191 there are also configurations 
which correspond to a geometry not bounded within a region of radius N ~ T, nor three- 
dimensional. Indeed, for any V, there are configurations which "fulfill" the volume. They 
contribute in the form of quantum perturbations, all of them falling under the "cover" of the 
Uncertainty Principle, and being therefore related to what we interpret as the quantum na- 
ture of physical phenomena. This can be interpreted in the following way: at any finite time 
T we have a universe which is infinitely extended, but that can be organized by separating 
it into a "classical part" , with a geometry looking like the interior of a black hole, with a 
horizon placed at distance oc T, and a quantum part, which accounts for the contribution 
of any other kind of configurations. Only the classical part can be reduced to the ordinary 
geometric interpretation of space extended only up to a distance oc T. In this perspective, 

• the space "outside" the horizon is infinitely extended, but it contributes to the perception 
of a classical observer and to the values of the observables defined in the three-dimensional 
classical space only through the uncertainty of mean values, accounted for by the Heisenberg's 
uncertainty. 

In the following we want to see how in this universe Einstein's special (and general) relativity 
are implied as a particular limit, in which one considers just the classical part of space. 

5. 1 From the speed of expansion of the universe to a maximal speed for the propagation of 
information 

The classical space corresponds to a universe of radius ~ N at time N, with total energy 
also N. It expands at speed 1. Indeed, we can introduce a factor of conversion from time to 
space, c, and say that, by choice of units, we set the speed of expansion to be c = 1 (in an 
obvious way, also the conversion between units of space, and time, on one side, and energy 
on the other side, is here "by default" set to one, but it can be called h). We want to see 
how this is also the maximal speed for the propagation of information within the classical 
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space. It is important to stress that all this refers only to the classical space as we have 
defined it, because only in this sense we can say that the universe is three dimensional: the 
sum 12 . 1 91 contains in fact also configurations that, through the time flow, can be interpreted 
as "tachyonic" , along with configurations in which it is not even clear what is the meaning of 
speed of propagating information in itself, as there is no recognizable information at all, at 
least in the sense we usually intend it. Indeed, when we say we get information about, say, 
the motion of a particle, or a photon, we intend to speak of a non-dispersive wave packet, so 
that we can say we observe a particle, or photon, that remains particle, or photon, along its 
motion 0. The existence, in the scenario implied by 12.191 of structures of this kind, namely 
of wave packets that behave like massive particles, or massless photons etc., is confirmed 
by the analysis performed in [2]. Let's consider the simplified case of a universe at time N 
containing only one such a wave packet 0, as illustrated in figure [H where it is represented by 
the shadowed cells, and the space is reduced to two dimensions. Consider now the evolution 



Figure 1: 



Figure 2: 



Figure 3: 



at the subsequent instant of time, namely after having progressed by a unit of time. Adding 
one point, N — > N + 1, does produce an average geometry of a three sphere of radius N + l 
instead of N. In the average, it is therefore like having added 4ttN 2 "points" , or unit cells. 
Remember that we work always with an infinite number of cells in an unspecified number 
of dimensions; when we talk of universe in three dimensions within a region of a certain 
radius, we just talk of the dominant geometry. Let's suppose the position of the wave packet 
jumps by steps (two cells) back, as illustrated in figure [2] Namely, as time, and consequently 
also the radius of the universe, progresses by one unit, the packet moves at higher speed, 
jumping by two units. Compare this case with the case in which the packet jumps by just 
one unit, as in figure [3) The entropy of this latter configuration, intermediate between the 
first and second one, cannot be very different from the one of the second configuration, 
figure El in which the packet jumps by two steps, because that was supposed to be the 
dominant configuration at time N+l, and therefore the one of maximal entropy. Indeed, by 
"continuity" it must interpolate between step 2 and the configuration at time N, that was 
also supposed to be a configuration of maximal entropy. Therefore, the actual appearance 



16 Like a particle, also a physical photon, or any other field, is not a pure plane wave but something 
localized, therefore a superposition of waves, a wave packet. 

17 We may think to concentrate onto only a portion of the universe, where only such a wave packet is 
present. 
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of the universe at time N + 1 must be somehow a superposition of the configurations 2 and 
3, thereby contradicting our hypothesis that the wave packet is non- dispersive B Therefore, 
the wave packet cannot jump by two steps, and we conclude that the maximal speed allowed 
is that of expansion of the radius of the universe itself, namely, c. 

It is too early here to discuss the actual existence in this scenario of degrees of freedom 
that can be interpreted as photons. In order to do this we must pass to a representation on 
the continuum, where, as discussed in [I], it corresponds to a string scenario. Here we just 
anticipate that, according to this theoretical framework, the reason why we have a universal 
bound on the speed of light is therefore that light carries what we call classical information. 
Information about whatever kind of event tells about a change of average entropy of the 
observed system, of the observer, and what surrounds and connects them too. The rate of 
transfer/propagation of information is therefore strictly related to the rate of variation of 
entropy. Variation of entropy is what gives the measure of time progress in the universe. 
Any carrier of information that "jumps" steps of the evolution of the universe, going faster 
than its rate of entropy variation, becomes therefore dispersive, looses information during its 
propagation. Light must therefore propagate at most at the rate of expansion of space-time 
(i.e. of the universe itself). Namely, at the rate of the space/time conversion, c. 

5.2 The Lorentz boost 

Let's now consider physical systems that can be identified as "massive particles", i.e. let 
us assume that there are local superpositions of configurations which are interpreted as 
travelling at speeds always lower than c. Since the phase space has a multiplicative structure, 
and entropy is the logarithm of the volume of occupation in this space, it is possible to 
separate for each such a system the entropy into the sum of an internal, "rest" entropy, and 
an external, "kinetic" entropy. The first one refers to the structure of the system in itself, 
that can be a particle or an entire laboratory. A point-like particle is an extended object of 
which we neglect the geometric structure. The second one refers to the relation/interaction 
of this system with the environment, the external world: its motion, the accelerations and 
external forces it experiences, etc. 

Let us for a moment abstract from the fact that the actual configuration of the universe 
implied by 12.191 at any time describes a curved space. In other words, let's neglect the so 
called "cosmological term" . This approximation can make sense at large N, as is the case of 
the present-day physics. This means at large age of the universe 0. Let us also assume we 
can just focus our attention on two observers sitting on two inertial frames, A and A', moving 
at relative speed v, neglecting everything else. For what said above, v < 1. An experiment 
is the measurement of some event that, owing to the fact that happening of something 

18 If it was dispersive, it would be something like a particle that, during its motion, "dissolves", and 
therefore we cannot anymore trace as a particle. It would be just a "vacuum fluctuation" without true 
motion, something that does not carry any information in the classical sense. 

19 To make contact with ordinary physics, consider that, once expressed in units in which the Planck 
constant and the speed of light are 1, the present age of the universe is estimated to be of order 10 31 , and the 
cosmological constant of order A ~ 10~ 61 . It is precisely its smallness what historically allowed to introduce 
special relativity and Lorentz boosts before addressing the problem of the cosmological constant. 
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means changing of entropy and therefore is equivalent to a time progress, is perceived as 
having taken place during a certain interval of time. Let us consider an experiment, i.e. the 
detection of some event, taking place in the co- moving frame of A', as reported by both the 
observer at rest in A, and the one at rest in A' (from now on we will indicate with A, and A', 
indifferently the frame as well as the respective observer). Let's assume we can neglect the 
space distance separating the two observers, or suppose there is no distance between them 
□j. For what we said above, such a detection amounts in observing the increase of entropy 
corresponding to the occurring of the event, as seen from A, and from A' itself. Since we are 
talking of the same event, the overall change of entropy will be the same for both A and A'. 
One would think there is an "absolute" time interval, related to the evolution of the universe 
corresponding to the change of entropy due to the event under consideration. However, the 
story is rather different as soon as we consider time measurements of this event, as reported 
by the two observers, A and A'. The reason is that the two observers will in general attribute 
in a different way what amount of entropy change has to be considered a change of entropy 
of the "internal" system, and which amount refers to an "external" change. Proper time 
measurements have to do with the internal change of entropy. For instance, consider the 
entropy of all the configurations contributing to form, say, a clock. The part of phase space 
describing the uniform motion of this clock will not be taken into account by an observer 
moving together with the clock, as it will not even be measurable. This part will however be 
considered by the other observer. Therefore, when reporting measurements of time intervals 
made by two clocks, one co-moving with A, and one seen by A to be at rest in A', owing 
to a different way of attributing elements within the configurations building up the system, 
between "internal" and "external" , we will have in general two different time measurements. 
Let us indicate with AS* the change of entropy as is observed by A. We can write: 

AS (= AS(A)) = ASfrnternal = at rest) + AS'(external) (5.1) 



= AS(A') + AS Kinctic (A) , (5.2) 

with the identifications A5'(internal = at rest) = AS(A') and AS'(external) = Akinetic (^4)- 
In section [2721 we discussed how the entropy of a three sphere is proportional to iV 2 = E 2 . 
This is therefore also the entropy of the average, classical universe, that in the continuum 
limit, via the identification of total energy with time, can be written as: 

S oc (cT) 2 , (5.3) 

where T is the age of the universe. This relation matches with the Hawking's expression of 
the entropy of a black hole of radius r = cT [HE]- It is not necessary to write explicitly 
the proportionality constant in (15.31) . because we are eventually interested only in ratios of 
entropies. During the time of an event, At, the age of the universe passes from T to T+ At, 
and the variation of entropy, AS = S(T + At) — S(T), is: 

AS oc (cAt) 2 + c 2 T 2 (^j . (5.4) 



-'In our scenario, huge (=cosmic) distances have effect on the measurement of masses and couplings. 
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The first term corresponds to the entropy of a "small universe", the universe which is "cre- 
ated", or "opens up" around an observer during the time of the experiment, and embraces 
within its horizon the entire causal region about the event. The second term is a "cosmo- 
logical" term, that couples the local physics to the history of the universe. The influence 
of this part of the universe does not manifest itself through elementary, classical causality 
relations within the duration of the event, but indirectly, through a (slow) time variation 
of physical parameters such as masses and couplings, (we refer to [2] for a discussion of the 
time dependence of masses and couplings. See also [9]). In the approximation of our abstrac- 
tion to the rather ideal case of two inertial frames, we must neglect this part, concentrating 
the discussion to the local physics. In this case, each experiment must be considered as a 
"universe" in itself. Let's indicate with At the time interval as reported by A, and with At' 
the time interval reported by A'. In units for which c = 1, and omitting the normalization 
constant common to all the expressions like 15.31 , we can therefore write: 

AS{A) — >■ (AS{A)) « (At) 2 , (5.5) 

whereas 

AS (A') ->• (AS(A')) « (At') 2 , (5.6) 

and 

AS Kinctic (A) = (vAt) 2 . (5.7) 

These expressions have the following interpretation. As seen from A, the total increase of 
entropy corresponds to the black hole-like entropy of a sphere of radius equivalent to the time 
duration of the experiment. Since v = c = 1 is the maximal "classical" speed of propagation 
of information, all the classical information about the system is contained within the horizon 
set by the radius cAt = At. However, when A attempts to refer this time measurement to 
what A' could observe, it knows that A' perceives itself at rest, and therefore it cannot include 
in the computation of entropy also the change in configuration due to its own motion (here it 
is essential that we consider inertial systems, i.e. constant motions). 11 A n separates therefore 
its measurement into two parts, the "internal one", namely the one involving changes that 
occur in the configuration as seen at rest by A' (a typical example is for instance a muon's 
decay at rest in A'), and a part accounting for the changes in the configuration due to the 
very being A' in motion at speed v. If we subtract the internal changes, namely we think at 
the system at rest in A' as at a point without meaningful physics apart from its motion in 
space 0, the entire information about the change of entropy is contained in the "universe" 
given by the sphere enclosing the region of its displacement, v 2 (At) 2 = Akinetic (^4) • In 
other words, once subtracted the internal physics, the system behaves, from the point of view 
of A, as a universe which expands at speed v, because the only thing that happens is the 
displacement itself, of a point otherwise fixed in the local universe (see figure Hj). Inserting 
expressions I5.5H5.7I in 15.21 we obtain: 

(Atf = ^2 (5.8) 

21 No internal physics means that we also neglect the contribution to the energy, and entropy, due to the 
mass. 
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Figure 4: During a time At, the pure motion "creates" a universe with an horizon at distance 
Ax = vAt from the observer. As seen from the rest frame, this part of the physical system 
does not exist. The "classical" entropy of this region is given by the one of its dominant 
configuration, i.e. it corresponds to the entropy of a black hole of radius Ax. 



that is: 

At' , N 

At = -f==. (5.9) 



vT 



v 



The time interval as measured by A results to be longer by a factor (a/1 ~ v 2 )~ 1 than as 
measured by A'. In this argument the bound on the speed of information, and therefore of 
light, enters when we write the variation of entropy of the "local universe" as AS = (cAt) 2 . 
If c — > oo, namely, if within a finite interval of time an infinitely extended causal region opens 
up around the experiment, both A and A' turn out to have access to the full information, 
and therefore At = At'. This means that they observe the same overall variation of entropy. 



5.2.1 the space boost 

In this framework we obtain in quite a natural way the Lorentz time boost. The reason 
is that, for us, time evolution is directly related to entropy change, and we identify config- 
urations (and geometries) through their entropy. The space length is somehow a derived 
quantity, and we expect also the space boost to be a secondary relation. Indeed, it can 
be easily derived from the time boost, once lengths and their measurements are properly 
defined. However, these quantities are less fundamental, because they are related to the clas- 
sical concept of geometry. We could produce here an argument leading to the space boost. 
However, this would basically be a copy of the classical derivation within the framework of 
special relativity. The derivation of the time boost through entropy-based arguments opens 
instead new perspectives, allowing to better understand where relativity ends and quantum 
physics starts. Or, to better say, it provides us with an embedding of this problem into a 
scenario that contains both these aspects, relativity and quantization, as particular cases, to 
be dealt with as useful approximations. 



5.3 General time coordinate transformation 



Lorentz boosts are only a particular case of the general coordinate transformation, obtained 
within the context of General Relativity; in that case the measure of time lengths is given by 
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the time-time component of the metric tensor. In the absence of mixing with space boosts, 
i.e., with a diagonal metric, we have: 

{dsf = goo(dt) 2 . (5.10) 
As the metric depends on the matter/energy content through the Einstein's Equations: 

Tl^u - = SkGnT^ , (5.11) 

<7oo can be computed when we know the energy of the system. For instance, in the case of a 
particle of mass m moving at constant speed v (inertial motion), the energy, the "external" 
energy, is the kinetic energy \mv 2 , and we recover the w 2 -dependence of the Lorentz boost 

In the simple case of the previous section, we have considered the physical system of the 
wave packet as decomposed into a part experiencing an "internal" physics, and a part which 
corresponds to the point of view of the center of mass, that is a part in which the complex 
internal physics is dealt with as a point-like particle. The Lorentz boost has been derived as 
the consequence of a transformation of entropies. Indeed, our coordinate transformation is 
based on the same physical grounds as the usual transformation of General Relativity, based 
on a metric derived from the energy tensor. Let us consider the transformation from this 
point of view: although imprecise, the approach through the linear approximation helps to 
understand where things come from. In the linear approximation, where one keeps only the 
first two terms of the expansion of the square-root \/l — v 2 /c 2 , the Lorentz boost can be 
obtained from an effective action in which in the Lagrangian appear the rest and the kinetic 
energy. These terms correspond to the two terms on the r.h.s. of equation 15.21 Entropy has 
in fact the dimension of an energy multiplied by a time EE Approximately, we can write: 

AS ~ AEAt , (5.12) 

where AE is either the kinetic, or the rest energy. The linear version of the Lorentz boost 
is obtained by inserting in ( 15 . 1 2[) the expressions AE rest = m and AE^ inetic = ^mv 2 . In 
this case, the linearization of entropies lies in the fact that we consider the mass a constant, 
instead of being the full energy of the "local universe" contained in a sphere of radius At, 
i.e. the energy (mass) of a black hole of radius At: m = AE = At/2. In our theoretical 
framework, the general expression of the time coordinate transformation is: 

(At') 2 = (AS'(t)) - (AS' external (t)) ■ (5.13) 

Here AS'(t) is the total variation of entropy of the "primed" system as measured in the "un- 
primed" system of coordinates: (AS"(t)) = {At) 2 . We can therefore write expression 15.131 
as: 

(At') 2 = [1 - Q{t)] (At) 2 , (5.14) 



22 In the determination of the geometry, what matters here is not the full force experienced by the particle 
but the field in which the latter moves. The mass m therefore drops out from the expressions (see for 
instance [TO]). 

23 By definition, dS = dE/T, where T is the temperature, and remember that in the conversion of ther- 
modynamic formulas, the temperature is the inverse of time. 
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where: 

Q(t) = AS '^nal(t) (515) 

(At) 

With reference to the ordinary metric tensor g^, we have: 

G(t) = l - goo . (5.16) 

AS' external (t) is the part of change of entropy of A' referred to by the observer A as something 
that does not belong to the rest frame of A'. It can be the non accelerated motion of A' , as in 
the previous example, or more generally the presence of an external force that produces an 
acceleration. Notice that the coordinate transformation [5J3] starts with a constant term, 1: 
this corresponds to the rest entropy term expressed in the frame of the observer. For the 
observer, the new time metric is always expressed in terms of a deviation from the identity. 

By construction, 15.151 is the ratio between the metric in the system which is observed 
and the metric in the system of the observer. From such a coordinate transformation we can 
pass to the metric of space-time itself, provided we consider the coordinate transformation 
between the metric g' of a point in space-time, and the metric of an observer which lies on 
a flat reference frame, whose metric is expressed in flat coordinates. We have then: 

l-S(t) = -^-^ (5.17) 

000 = ^oo = 1 

As soon as this has been clarified, we can drop out the denominator and we rename the 
primed metric as the metric tout court. 



5.4 General Relativity 

Once the measurement of lengths is properly introduced, as derived from a measurement of 
configurations along the history of the system, it is possible to extend the relations also to the 
transformation of space lengths. This gives in general the components of the metric tensor 
as functions of entropy and time. In classical terms, whenever this reduction is possible, 
this can be rephrased into a dependence on energy (energy density) and time. They give 
therefore a generalized, integrated version of the Einstein's Equations. Let's see this for 
the time component of the metric. We want to show that the metric goo of the effective 
space-time corresponds to the metric of the distribution of energy in the classical space, i.e., 
in the classical limit of effective three-dimensional space as it arises from 12.191 This will 
mean that the geometry of the motion of a particle within this space is the geometry of the 
energy distribution. In particular, if the energy is distributed according to the geometry of 
a sphere, so it will be the geometry of space-time in the sense of General Relativity. To this 
regard, we must remember that: 

i) All these arguments make only sense in the "classical limit" of our scenario, namely 
only in an average sense, where the universe is dominated by a configuration that can be 
described in classical geometric terms. It is in this limit that the universe appears as three 
dimensional. Configurations which are in general non three-dimensional, non-geometric, 
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possibly tachyonic, and, in any case, configurations for which General Relativity and Einstein 
Equations don't apply, are covered under the "un-sharping" relations of the Uncertainty 
Principle. All of them are collectively treated as "quantum effects" ; 

ii) In the classical limit, nothing travels at a speed higher than c. As during an experiment 
no information comes from outside the local horizon set by the duration of the experiment 
itself, to cause some (classical) effects on it, any consideration about the entropy of the 
configuration of the object under consideration can be made "local" (tachyonic effects are 
taken into account by quantization). That means, when we consider the motion of an object 
along space we can just consider the local entropy, which depends on, and is determined by, 
the energy distribution around the object. 

Having these considerations in mind, let us consider the motion of a particle, or, more 
precisely, a non-dispersive wave-packet, in the three-dimensional, classical space. Consider to 
perform a (generally point-wise) coordinate transformation to a frame in which the metric of 
the energy distribution external to the system intrinsically building the wave packet in itself 
is flat, or at least remains constant. As seen from this set of frames, along the motion there is 
no change of the (local) entropy around the particle, and the right hand side of 15 .151 vanishes, 
implying that also the metric of the motion itself remains constant (remember that 15.151 in 
this case gives the ratio between metrics at different points/times). This means that the 
metric of the energy distribution and the metric of the motion are the same, and proves the 
equivalence of !5.13l and 15. 151 with the Einstein's equations 15. Ill If on the other hand we keep 
the frame of the observer fixed, and we ask ourselves what will be the direction chosen by the 
particle in order to decide the steps of its motion, the answer will be: the particle "decides" 
stepwise to go in the direction that maximizes the entropy around itself. Let us consider 
configurations in which the only property of particles is their mass (no other charges), so 
that entropy is directly related to the "energy density" of the wave packet. In this case, 
between the choice of moving toward another particle, or far away, the system will proceed 
in order to increase the energy density around the particle. Namely, moving the particle 
toward, rather than away from, the other particle, in order to include in its horizon also the 
new system. This is how gravitational attraction originates in this theoretical framework. 

In order to deal with more complicated cases, such as those in which particles have 
properties other than just their mass (electro-magnetic/weak/strong charge), we need a 
more detailed description of the phase space. In principle things are the same, but the 
appropriate scenario in which all these aspects are taken into account is the one in which 
these issues are phrased and addressed within a context of (quantum) String Theory. This 
analysis, first presented in Ref. j9], is discussed in detail in Ref. [2]. 

5.5 The metric around a black hole 

Let us consider once more the general expression relating the evolution of a system as is seen 
by the system itself, indicated with A', and by an external observer, A, expressions 15.11 and 
15.21 In the large-scale, classical limit, the variations of entropy AS(A) and AS(A') can be 
written in terms of time intervals, as in !5.5l and l5.6[ in which t and t' are respectively the time 
as measured by the observer, and the proper time of the system A'. In this case, as we have 
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seen expression 15.21 can be written as (At') 2 = (At) 2 — (AS' cxtcrnal (t)) (see expression 15.131) . 
and the temporal part of the metric is given by: 

#00 - 1. (.0.18J 

As long as we consider systems for which g Q o is far from its extremal value, expression 15.181 
constitutes a good approximation of the time component of the metric. However, a black 
hole does not fall within the domain of this approximation. According to its very (classical) 
definition, the only part we can probe of a black hole is the surface at the horizon. In the 
classical limit the metric at this surface vanishes: goo — > (an object falling from outside 
toward the black hole appears to take an infinite time in order to reach the surface). This 
means, 

(A5 extemaJ ) « oc (At) 2 . (5.19) 

However, in our set up time is only an average, "large scale" concept, and only in the 
large scale, classical limit we can write variations of entropy in terms of progress of a time 
coordinate as in l5.5l and l5.6l The fundamental transformation is the one given in expressions 
15.11 15.21 and the term goo has only to be understood in the sense of: 

AS(A') — ► (AS{A')) = At'gooAt' . (5.20) 

The apparent vanishing of the metric 15.181 is due to the fact that we are subtracting contri- 
butions from the first term of the r.h.s. of expression 15.21 namely AS(A'), and attributing 
them to the contribution of the environment, the world external to the system of which we 
consider the proper time, the second term in the r.h.s. of 15.21 AS cxtC ma,\(A). Any physical 
system is given by the superposition of an infinite number of configurations, of which only the 
most entropic ones (those with the highest weight in the phase space) build up the classical 
physics, while the more remote ones contribute to what we globally call "quantum effects" . 
Therefore, taking out classical terms from the first term, AS (A'), the "proper frame" term, 
means transforming the system the more and more into a "quantum system" . In particular, 
this means that the mean value of whatever observable of the system will receive the more 
and more contribution by less localized, more exotic, configurations, thereby showing an 
increasing quantum uncertainty. In particular, the system moves toward configurations for 
which Ax — 1/Ap. Indeed, one never reaches the condition of vanishing of 15.201 because, 
well before this limit is attained, also the notion itself of space, and time, and three dimen- 
sions, localized object, geometry, etc., are lost. The most remote configurations in general 
do not describe a universe in a three-dimensional space, and the "energy" distributions are 
not even interpretable in terms of ordinary observables. At the limit in which we reach 
the surface of the horizon, the black hole will therefore look like a completely delocalized 
object [I]. 

5.6 Natural or real numbers? 

The approach we are proposing, and the fact that from the collection of binary codes we arrive 
to the structures of our physical world, implies a question about what is after all the world we 
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experience. We are used to order our observations according to phenomena that take place in 
what we call space-time. An experiment, or, better, an observation (through an experiment), 
any perception in itself, basically consists in realizing that something has changed: our "eyes" 
have been affected by something, that we call "light" , that has changed their configuration 
(molecular, atomic configuration). This light may carry information about changes in our 
environment, that we refer either to gravitational phenomena, or to electromagnetic ones, and 
so on... In order to explain them we introduce energies, momenta, "forces", i.e. interactions, 
and therefore we speak in terms of masses, couplings etc... However, all in all, what all 
these concepts refer to is a change in the "geometry" of our environment, a change that 
"propagates" to us, and eventually results in a change in our brain, the "observer". But 
what is after all geometry, other than a way of saying that, by moving along a path in 
space, we will encounter or not some modifications? Assigning a "geometry" is a way of 
parametrizing modifications. Is it possible then to invert the logical ordering from reality 
to its description? Namely, can we argue that what we interpret as energy, or geometry, 
is simply a code of information 0. Something happens, i.e. time passes, when some code 
changes. Viewed in this way, it is not a matter of mapping physical degrees of freedom 
into a language of abstract codes, but the other way around, namely: perhaps the deepest 
reality is "information", that we arrange in terms of geometries, energies, particles, fields, 
and interactions. When we "see" the universe, we interpret the codes in terms of maps, from 
a space of "energies" to a target space, that take the "shape" of what we observe as the 
physical reality. From this point of view, information is not just something that transmits 
knowledge about what exists, but it is itself the essence of what exists, and the rationale 
of the universe is precisely that it ultimately is the whole of rationale. The quantum (in 
the sense of indeterministic) nature of the universe is then the consequence of being any 
observable not just a code but a collection, a superposition, of codes. 

Reducing everything to a collection of binary codes means reducing everything to a 
discrete description in terms of natural numbers, i.e. to saying that the whole of rationale 
is numerable. One may wonder whether natural numbers are enough to encode all the 
information of the universe. At first sight, one would say that real numbers say "more", 
allow to express more information. Moreover, they appear to be "real" in the true sense of 
something existing in nature. For instance, one can think to draw with the pencil a circle 
and a diameter. Then, one has physically realized two lines of lengths that don't stay in 
a ratio expressible as a rational number. However here the point is: what is really about 
the microscopical nature of these two drawings? At the microscopical level, at the scale 
of the Planck length, the notion of space itself is so fuzzy to be practically lost. In our 
scenario, an analysis of the superposition of configurations tells us that, before reaching this 
scale, remote configurations, whose contribution is usually collected under the Heisenberg's 
Uncertainty, count more and more. In other words, the world is no more classical but 
deeply quantum mechanical, to the point that the uncertainty in the length of the two 
lines doesn't allow us to know whether their ratio is a real or a rational number. In this 
sense, this analysis provides further support to an old idea which goes back to Konrad Zuse, 
that all the information of the universe is expressible through natural numbers, and, as 

24 See for instance the "it from bit" of J. A. Wheeler, and the work of C. F. Weizsacker. 
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a consequence, the discrete description of the universe, and in particular of space-time, is 
not just an approximation, but indeed the most fundamental one can think about. After 
all, real numbers are introduced in mathematics through definitions and procedures, whose 
informational content can be "written" as a text with a computer program. This means 
that, as a matter of pure information content, real numbers can be introduced via natural 
numbers. 
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